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Abstract Either we leave the definition of the halting function incomplete, not saying 
its result when applied to its own program, or we suffer inconsistency. If we choose 
incompleteness, we cannot require a halting program to apply to programs that invoke 
the halting program, and we cannot conclude that it is incomputable. If we choose 
inconsistency, then it makes no sense to propose a halting program. Either way, the 
incomputability argument is lost. 
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Introduction 


In 1936 Alan Turing wrote a paper [6] introducing a computing machine that we now call a 
“Turing Machine”. He showed how it can be programmed, and how different programs give 
the machine different behaviors, or in his terminology, create different machines. He 
showed how one such machine, a “Universal Turing Machine”, can be given a description of 
any Turing Machine (a program), and then simulate the operation of that Turing Machine 
(execute the program). That work remains an important part of the foundation of computer 
science. Page 247 of that paper is a proof that a certain problem that we now call the 
“Halting Problem” cannot be solved by computation. I want to re-examine that proof. 

The halting function is defined to say whether a program's execution terminates. In this 
paper I will use the word “function” to mean a mathematical total function, mapping every 
element of its domain to an element of its range. 

To apply a function to the domain of programs, Turing encoded programs as numbers. 
His programs were sequences of Turing Machine operations; each operation was encoded as 
a sequence of decimal digits, and each program was encoded by joining together the digit 
sequences representing the operations. In this paper I use programming notations that are 
typical of modern programming languages, rather than Turing Machine operations. Today, 
when programs are presented as input data to a compiler or interpreter, they are represented 
as texts (character strings), and that's the encoding I will use. These changes modernize and 
simplify the presentation without changing anything essential in the proof of 
incomputability. 

In some formulations of the Halting Problem, the halting function is applied to two 
operands: a representation of a program, and the initial state of the program variables (the 
input). The function tells whether the program's execution terminates when started in the 
initial state. The presentation of the Halting Problem can be simplified by eliminating the 
initial state (input) operand. One way to do that is to start execution of every program in the 
same initial state; if you wanted some other initial state, just begin the program with some 
initializing assignments to create the state you wanted. In other words, creation of the initial 
state you wanted is the initial part of the program. Turing did not distinguish the initial state 
of program variables from program, and I will follow Turing on that point. 


1 Eric Hehner 2011 


The proof is a “diagonal argument”, famously used by Georg Cantor [1] in 1890, and by 
Kurt Gödel [2] in 1931. In Turing's proof, the diagonalization is implicit in the self- 
referential definition of a program code to which he applies the halting function. 


Notations and Terminology 


For the purpose of this paper, we need a programming language that includes the following 
three kinds of programming statement: 

x:= E 

if B then P else Q fi 

while B do P od 
These are standard assignment, conditional, and loop statements found in most modern 
programming languages (except perhaps for minor syntactic differences). In the assignment 
statement, x is a program variable, and E is an expression of the same type as x. In the 
next two, B is a boolean expression, and P and Q are programming statements. The two 
boolean values are T (true)and L (false). By “boolean expression” I mean an expression 
whose overall type is boolean; subexpressions within it are not restricted to type boolean. 
Any programming language will include only a restricted choice of types and operators; for 
our purpose, all we need is call, and some way to return a result. I refer to any program 
fragment (statement or expression) as a “program”. 


Halting Problem 


Define function h: P—B , where P is the data type of texts that represent programs, and 
B is the boolean data type, so that when A is applied to a text representing a program 
whose execution terminates, the result is T , and when applied to a text representing a 
program whose execution does not terminate, the result is L . Here are two examples: 
AC x=0”) = T 
h (“ while T do x:= 10d”) = L 


Assume that h is computable, and that a program H for it has been written. Then 
H (“ x= 0 ») = T 
H (“ while T do x:= 10d”) = L 


Define program text D (for diagonal) as follows: 
D = “if H (D) then while T do x:= | od else x:= 0 fi” 


We place the definitions of H and D in a dictionary (or library) of definitions. When 
identifiers (like H and D ) occur within a program, they are interpreted (or understood, 
or given meaning) by looking them up in the dictionary. 


Argue: If H (D) is T ,then D represents a program that is equivalent to 

while T do x:= 1 od, whose execution does not terminate, and so H (D) is L. And if 
H (D) is L ,then D represents a program that is equivalent to x:= 0 , whose execution 
does terminate, and so H (D) is T . We have an inconsistency. 


Conclude: program H does not exist; function h is not computable. 
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There is a concern about the preceding proof: D is defined as a text, but is it a 
text that represents a program? We have supposed that if then else fi and while do od 
and T and assignment and function call are all in the programming language, but to 
know that 

if H (D) then while T do x:= 1 od else x:= 0 fi 

is a program, we need to know that H is being applied to an element of its domain. In other 
words, we need to know whether D represents a program, which was the question. The 
proof quietly makes the assumption that D does represent a program. Then, when the 
argument exposes an inconsistency, the cause could be this assumption, rather than the 
assumption of computability. I will repair this problem later in the section titled “Direct 
Proof”. 

The argument that there is an inconsistency was made informally, both in Turing's paper 
(see the first Appendix) and in the box above. Informality can hide problems, so I now write 
the argument as a formal proof. 


H (D) use definition of D 
= H (“ if H (D) then while T do x:= 1 od else x:= 0 fi”) semantic transparency 
= if H (D) then H (“ while T do x:= | od”) else H (“ x:=0 ”) fi given examples 
= if H (D) then L else T fi boolean algebra 
= ~H (D) 


The property “semantic transparency? means that if two programs are semantically 
equivalent, then function h gives the same result when applied to their representations. 
And therefore program H must also. The specific instance used here formalizes the 
argument that if H (D) is T , then the program 
if H (D) then while T do x:= 1 od else x:= 0 fi 

is equivalent to its then-part, and so H applied to the representation of that program must 
give the same result as when applied to the representation of its then-part. Likewise when 
H (D) is L, H applied to the representation of that program must give the same result as 
when applied to the representation of its else-part. 

Semantic transparency misses part of the computation. In the usual (eager) 
interpretation of if... then ... else ... fi , its execution terminates exactly when the following 
are true: 

e execution of the if-part terminates 

e ifthe if-part terminates with result T , execution of the then-part terminates 

e ifthe if-part terminates with result L , execution of the else-part terminates 

Semantic transparency missed the first bullet-point. Here is a proof that includes the missing 
piece. In it, I have used the words “function transparency” to say that when a function is 
applied to equal operands, the results are equal; it is a general property in mathematics, not 
specific to programs. The three bullet-points are called “interpretation”. 


T definition of D 
= D = “if H (D) then while T do x:= 1 od else x:= 0 fi” function transparency 
=> H(D) = H (“if H (D) then while T do x:= 1 od else x:= 0 fi ”) interpretation 
= H (D) = (H(“H (D)”) ^ if H(D) then H (“while T do x:=1 od”) else H (‘‘x:=0”) fi) 

given examples 
H (D) (H (“ H (D) ”) a if H (D) then L else T fi) boolean algebra 
H (D) = (AC A(D)”) a AA (D)) boolean algebra 
~H (“H(D)”) a ~H(D) 
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The last line is the uncomfortable conclusion that program H tells us both that execution of 
H (D) does not terminate and that execution of the program represented by D does not 
terminate. There is no formal inconsistency, and we cannot formally conclude that h is 
incomputable. I will repair this problem later in the section titled “Direct Proof”. 


Calumation Problem 


An argument identical to the Halting Problem proves that any property of program 
executions is incomputable. 


Define function h: PB , where P is the data type of texts that represent programs, and 
B is the boolean data type, so that when h is applied to a text representing a program 
whose execution calumates, the result is T , and when applied to a text representing a 
program whose execution does not calumate, the result is L . I don't define calumation; 
I just give two examples: 

ACx=0”) = T 

h(“x:=1”)= L 


Assume that h is computable, and that a program H for it has been written. Then 
H (“ x= 0 ”) = lr 
H (“x:=1”)= L 


Define program text D as follows: 
D = “if H (D) then x:= | else x:= 0 fi” 


Argue: If H (D) is T ,then D represents a program that is equivalent to x:= 1 , whose 
execution does not calumate, and so H (D) is L . And if H (D) is L ,then D 
represents a program that is equivalent to x:= 0 , whose execution does calumate, and so 
H (D) is T . We have an inconsistency. 


Conclude: program H does not exist; function h is not computable. 


All we need to know about A is that it has a positive and a negative example, which I chose 
arbitrarily. We don't even need to know what the positive and negative examples are; if we 
just know that there is at least one of each, we can make the same proof. And the range of 
the function need not be boolean; it can be any set with at least two values. According to 
this generalization of Turing's proof, every function applying to representations of programs, 
and having semantic transparency, is incomputable, with the possible exception of constant 
functions (functions that don't actually depend on their operands) [5]. 


Russell's Barber 


Before I discuss the Halting Problem further, I present a well-known “paradox” attributed to 
Bertrand Russell, in the hope that it will illuminate the Halting Problem. 
In a very small town there are exactly 3 men, named A, B,and C. The man named 
B is the barber. The barber shaves the men in the town who do not shave themselves. 
(Perhaps today it would make more sense to talk about cutting hair, but in Russell's day 
barbers also shaved men who did not shave themselves.) Formalizing, 
Vm E€ {A, B,C} (B shaves m) = =(m shaves m) 
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An immediate consequence is obtained by specialization: 
(B shaves B) = -=(B shaves B) 
and we have an inconsistency. 

This inconsistency surprises some people, and that's why it was called a “paradox”. 
Here, I think, is the source of the surprise. The sentence “The barber shaves the men in the 
town who do not shave themselves.” seems perfectly reasonable; that's what barbers do (or 
did in Russell's day). Anyone saying the sentence in earnest means “The barber shaves the 
other men in the town who do not shave themselves.”. Formally, 

Vm E {A, C} (B shaves m) = -=(m shaves m) 
The question whether B shaves B is left open, and can be answered either way without 
inconsistency. Similarly, someone might say “There's no-one here.”, and reasonable people 
understand that to mean “no-one but me”. Someone might say “She's smarter than anyone.”, 
or “I never eat fish, but on that one occasion I did.”, and reasonable people understand. But 
Bertrand Russell was a philosopher and logician; he took the sentence literally, and exposed 
the inconsistency, to the surprise of reasonable people. 

Is shaves computable? I am not seriously asking, but I give the question the same sort 
of proof that we have already seen twice. 


Define function shaves: MXM—B , where M is the men of the town, and B is the 
boolean data type, such that the barber shaves the men in the town who do not shave 


themselves. 


Assume that shaves is computable, and that a program SHAVES for it has been written. 


Argue: 

Vm € {A, B, Cy} (B SHAVES m) = -=(m SHAVES m) 
and by specialization 

(B SHAVES B) = -=(B SHAVES B) 
We have an inconsistency. 


Conclude: program SHAVES does not exist; function shaves is not computable. 


Initially, shaves is defined informally; a reasonable person understands it, and does not 
doubt its consistency. After the computability assumption, we use Russell's literal 
interpretation of the definition, including the question whether B SHAVES B . I do not 
intend this proof to be taken seriously; I use it to illustrate what happens when 
inconsistency is not suspected before a computability assumption, and then is discovered 
after. If we intend the characteristic property of shaves to apply toallof A, B,and C, 
then shaves is inconsistently defined, and its computability is not a sensible question. If we 
intend the characteristic property of shaves to apply to only A and C , then we should not 
require the characteristic property of its program counterpart SHAVES to apply to B. 
Either way, the conclusion is void. 

Could there be a hidden inconsistency in the definition of the halting function, without 
assuming computability? 


Oracle Halting Problem 


Let us look again at the Halting Problem, but this time without the assumption that h is 
computable. To do that, we augment the programming language with an oracle % that 
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magically tells whether a program in this augmented programming language has terminating 
behavior. The oracle's behavior is always terminating. The concept of an oracle within a 
program was invented by Turing [7] in his 1938 PhD thesis to study relative computability. 
It exactly serves our purpose: to provide the result of the halting function without concern 
for how, or even whether, it can be computed. 


Define function h: PB , where P is the data type of texts that represent programs in 
the augmented programming language, and B is the boolean data type, so that when h 
is applied to a text representing a program whose execution terminates, the result is T , 
and when applied to a text representing a program whose execution does not terminate, 
the result is L. Here are two examples: 

AC x=0”) = T 

h (“ while T do x:= 10d”) = L 


Let # be an oracle for h. Then 
H G x= 0 ”) = T, 
# (“ while T do x:= 1 od”) = 


Define program text D as follows: 
D = “if 7 (D) then while T do x:= 1 od else x:= 0 fi” 


Argue: If # (D) is T , then D represents a program that is equivalent to 
while T do x:= 1 od , whose execution does not terminate, and so #(D) is L. And if 
#(D) is 1 ,then D represents a program that is equivalent to x:= 0 , whose execution 
terminates, and so #(D) is T . We have an inconsistency. 


Conclude: ? 


The Oracle Halting Problem shows us that there is an inconsistency even without the 
assumption of computability. As before, we could substitute any property (e.g. calumation) 
in place of termination. But this time, without the assumption of computability, we cannot 
conclude its opposite. What should we conclude? that there cannot be an oracle for the 
halting function? 

The use of a magic oracle may cause the reader concern about the legitimacy of the 
result: if we assume magic, we might come to unreal conclusions. Indeed, if we had come 
to a positive conclusion of the form “Therefore we can compute X.”, it would be invalid. 
But we came to a negative conclusion: “even if we are given the result of the halting 
function, with no need to compute it, there is still an inconsistency”, and that conclusion is 
valid. To remove all concern, I now replace the oracle with its modern counterpart: 
specification. 


Specification Halting Problem 


In Turing's time, programs were commands to a computer. Today, they are also 
mathematical expressions in their own right [3][4]. Programs are specifications of computer 
behavior written in a restricted formal notation (a programming language) that a computer 
can execute. Not all specifications of computer behavior are programs. We might start a 
programming task with an informal, natural language specification of desired computer 
behavior, then formalize the specification using any mathematical notations that we find 
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useful, then refine the specification until we have a program. During refinement, the 
specification is a mixture of programming and other notations. Instead of imagining a magic 
oracle to use in a program, we can write a specification using the mathematical halting 
function. Here is the Specification Halting Problem. 


Define function h: SB ,where S is the data type of texts that represent specifications 
of computer behavior, and B is the boolean data type, so that when h is applied to a 
text representing a specification of terminating behavior, the result is T , and when 
applied to a text representing a specification of nonterminating behavior, the result is L. 
Here are two examples: 

ACx=0”) = T 

h (“ while T do x:= 1 0d”) = L 


Define specification text d as follows: 
d = “if h (d then while T do x:= 1 od else x:= 0 fi” 


Argue: If h(d) is T ,then d represents a specification that is equivalent to 

while T do x:= 1 od , which specifies nonterminating behavior, and so h (d) is L. And 
if h(d) is L ,then d represents a specification that is equivalent to x:= 0 , which 
specifies terminating behavior, and so h (d) is T . We have an inconsistency. 


Conclude: ? 


Moving from the original Halting Problem to the Specification Halting Problem, we 
augmented the domain of h from program texts to specification texts. But to arrive at 
inconsistency, all we really need is a domain consisting of three texts: the two program texts 
“x:=0” and “ while T do x:= 1 od” , and specification text d. Applying h to the two 
program texts is not problematic; the results are given as examples. The problem arises 
when we apply h to d. If we intend h to apply to all three of “x:=0”, “ while T do 
x:= 1 od” ,and d (in the Specification Halting Problem), then A is inconsistently defined, 
and its computability is not a sensible question. If we intend h to apply to only “x:=0” 
and “ while T do x:= 1 0d” , then we should not require its program counterpart H to 
apply to D (in the original Halting Problem). 


Mimic Halting Problem 


In the original Halting Problem, the role of the computability assumption has nothing to do 
with computability. To illustrate, I invent a new kind of function, which I will call “mimic 
functions”. Some functions have a corresponding mimic function; some do not. If a 
function has a corresponding mimic function, the function and its mimic have the same 
results. (If you cannot see any distinction between a function and its mimic, you are in good 
company. But bear with me.) When we ask whether the halting function has a mimic, here's 
what happens. 


Define function h: SB ,where S is the data type of texts that represent specifications 


of computer behavior, and B is the boolean data type, so that when h is applied to a 
text representing a specification of terminating behavior, its result is T , and when 
applied to a text representing a specification of nonterminating behavior, its result is 1 . 
Here are two examples: 
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h (S x= 0 ”) = T 
h (“ while T do x:= 10d”) = L 


Assume that function h has a mimic function H . Then 
H (“ x= 0 ») = T 
H (“ while T do x:= 10d”) = L 


Define specification text D as follows: 
D = “if H (D) then while T do x:= 1 od else x:= 0 fi” 


Argue: If H (D) is T ,then D represents a specification that is equivalent to 
while T do x:= 1 od , which specifies nonterminating behavior, and so H (D) is L. 
And if H (D) is L ,then D represents a specification that is equivalent to x:= 0 , which 
specifies terminating behavior, and so H (D) is T . We have an inconsistency. 


Conclude: mimic function H does not exist; function h is not mimicable. 


The Mimic Halting Problem talks about specifications, as does the Specification Halting 
Problem, but its structure is now identical to the original Halting Problem. The role of the 
mimic function H in this proof is to take the blame, protecting the original function h from 
blame. 

As hinted in parentheses before the proof, there is no distinction between a function and 
its mimic, so the Mimic Halting Problem is bogus. Likewise, in the original Halting 
Problem, function h and program H are both treated as mappings from the same domain 
elements to the same range elements, giving the same results. There is no difference 
between them that is used in the proof. In the original Halting Problem, the only role of 
program H is to take the blame, protecting the mathematical function h from blame. 


Nature of Computation 


There is an important difference between a mathematical function and a program. A 
program specifies a physical process that progresses through steps that take resources, in 
particular, time, and in the extreme, may take infinite time (not producing a result). A 
mathematical function is unconcerned with any process by which the result is produced; it 
is just a mapping from a domain to a range. In general, programs cannot be characterized as 
total mathematical functions from initial state to final state, with no mention of time. They 
have been characterized as total functions to a range augmented with an extra element that 
stands for nontermination. They have been characterized as partial functions. And they 
have been characterized as functions or relations with a time component. The time 
component may be as simple as a boolean variable that distinguishes finite from infinite time 
[4]. Or it can be a numeric variable (integer or rational or real) extended with an infinite 
value to account for nontermination [3]. 

Turing distinguished infinite and finite execution time, using the words “circular” and 
“circle-free”. After arguing that H (D) can produce neither T nor L due to inconsistency, 
Turing concluded that H (D) is circular, producing no result, contrary to the computability 
assumption. The computability assumption is that function h can be computed by program 
H . But the argument that follows that assumption makes absolutely no use of the 
computational nature of H . It treats H as a total function, just as if it were h. It applies 
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H to the same domain elements, producing the same results that h would produce. 
Program H mimics mathematical function A , and its only role is to take the blame. 

We have now seen function h , oracle # , and program H , all applying to texts that 
represent programs, all telling us whether execution terminates. Oracle # seems to be 
intermediate between h and H in the sense that, like h it may not be computable, like H 
we can use it in a program. The proof makes no use of any distinction among them. Both 
%# and H mimic h. Since h, #,and H are the same, the three diagonal definitions 


d = “if h(d) then while T do x:= 1 od else x:= 0 fi ” 
D = “if? (D) then while T do x:= | od else x:= 0 fi” 
D = “if H (D) then while T do x:= | od else x:= 0 fi” 


are the same. If, under the computability assumption, D is well-defined, then, without any 
computability assumption, D and d must be just as well-defined. And the same argument 
can be made using any of these diagonal definitions: h (d), #(2),and H (D) can be 
neither T nor due to inconsistency. The inconsistency is not due to a computability 
assumption. 

The combination of h and d is inconsistent; to restore consistency, we withdraw the 
definition of d . Likewise the combination of # and D is inconsistent; to restore 
consistency, we withdraw the definition of D . Likewise the combination of H and D is 
inconsistent; to restore consistency, we withdraw the definition of D . 


Simple Problem 


Define function h: N-B as a function from naturals to booleans. Here are two 
examples: 

h(O) = T 

AQA)= 1 


Assume that h is computable, and that a program H for it has been written. Then 
H(O) = T 
H(1)= L 


Define natural D as follows: 
D = if H (D) then 1 else 0 fi 


Argue: If H (D) is T ,then D is 1 ,and so H (D) is L. Andif H (D) is L,then D 
is 0 ,and so H (D) is T . We have an inconsistency. 


Conclude: program H does not exist; function h is not computable. 


Function h could be the halting function applied to programs encoded as numbers, as in 
Turing's paper. Program number O could be one whose execution terminates, and program 
number 1 one whose execution does not terminate. Thus this proof could be talking about 
the halting function. But this time I omitted all the words describing what function h is 
supposed to be, so h could be any function that maps O to T and 1 to L. Perhaps h is 
the even function. And this proof comes to the absurd conclusion that such a function is 
incomputable. (The difference between the Simple Problem and the original Halting 
Problem is a change in the coding levels in the definition of D .) 

This proof does show that the combination of definitions of H and D is inconsistent. 
If we also define 
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d = ifh(d then | else 0 fi 
we similarly find that the combination of definitions of and d is inconsistent. There is 
no difference between h and H that is used in the argument. So there is also no difference 
between the definitions of d and D. If we intend hA to apply to d , we have an 
inconsistency before we get to the assumption of computability. If we withdraw the 
definition of d to restore consistency, then we should not ask H to apply to D. 


Direct Proof 


The form of proof we have been looking at so far is “proof by contradiction”. We make a 
computability assumption, then we discover an inconsistency or contradiction, and we 
conclude that the computability assumption was wrong. But it is possible that the 
inconsistency was already there before the computability assumption, or that the 
inconsistency was due to a later unstated assumption. Here is a Direct Proof, with no 
computability assumption. 


Let < be a programming language that includes the text (character string) data type and 
the boolean data type. Let h (the halting function) be a function that applies to texts that 
represent programs in <. When h is applied to a text representing a program whose 
execution terminates, the result is T , and when applied to a text representing a program 
whose execution does not terminate, the result is L. In < , define 

T = (aprogram whose execution terminates) 

N = (a program whose execution does not terminate) 

P = (a program that applies to texts with boolean result) 
D = “if P(D) then N else T fi” 


As part of program compilation/interpretation, definitions are placed in a dictionary. Also 
as part of compilation/interpretation, these definitions are used to supply meaning for the 
identifiers T, N, P,and D whenever they are used in a program. 


Program P applies to all texts, regardless of whether they represent programs. Therefore 
if P (D) then N else T fi 
is a program, and so D isa text representing a program. Calculate 


definition of D 

“if P (D) then N else T fi” function transparency 

h (D) h (“if P (D) then N else T fi ”) interpretation 

h(D) = (h (“P (DYP) ^ if P (D) then h (“N”) else h (‘T”’) fi) given examples 

h (D) (h (“P (DY’) ^ if P (D) then L else T fi) boolean algebra 
h (D) = (h (“P (DY) 0 =P (D) 


For program P to be an implementation of function h, P has to apply to at least the 
domain of h (it does), and on the domain of h it has to give the same results. Suppose 
h (D) is T . Then the last line of the calculation h (“P (D)’) a ~ P (D) means that 
execution of P (D) terminates with result L , and so P is not an implementation of h. 
Suppose h (D) is L. Then, according to the last line of the calculation, either execution 
of P (D) does not terminate, or P (D) is T ,and so again P is not an implementation 
of h . In all cases, P is not an implementation of h . Since P was any program from 
texts to booleans, the halting function is incomputable. 
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Proofs of incomputablility are essentially classical (nonconstructive). That's clear in this 
Direct Proof: we don't know whether h (D) is T or L , but either way, it differs from 
P(D). 

The first criticism of the Turing proof, or rather, of our modern formalization of it, was 
that we cannot say whether the diagonal definition defines a representation of a program. 
Strangely, by insisting in the Direct Proof that P applies to all texts, not just to texts that 
represent programs, we can be certain that D represents a program; that solves the 
problem, even though D is the only argument we want to apply P to. 

The second criticism of the Turing proof was that semantic transparency did not take 
full account of the interpretation of if-then-else-fi ; specifically, execution of the if-part 
must terminate. The Direct Proof does not have that flaw. 

The Turing proof made a computability assumption, and concluded that this particular 
assumption was the cause of inconsistency. The Direct Proof makes no computability 
assumption. It introduces P as any program, and concludes that P cannot be an 
implementation of the halting function. 

The Direct Proof has escaped all previous criticisms except the main one. Suppose 
language < includes the halting oracle. Since P is any program from texts to booleans, it 
could be the halting oracle (extended to give result L (or anything) when applied to a text 
that does not represent a program). The Direct Proof concludes that the halting oracle 
differs from the halting function on program text D . But the halting oracle is defined to 
agree with the halting function on all program texts. So we have an inconsistency between 
the halting oracle and the diagonal definition. Since the halting oracle is identical to the 
halting function, not even differing by consuming resources, the inconsistency, as always, is 
between the halting function and the diagonal definition. 

To avoid the word “oracle”, let P be any specification that applies to texts with a 
boolean result, require the halting function to apply to one diagonally defined specification, 
and conclude that P differs from h. But P could be h. Therefore there is an 
inconsistency between the halting function and the diagonal definition. 


Reversed Halting Problem 


The source of the inconsistency can be seen most clearly by reversing the process: this time 
I start with the inconsistency, and build toward the halting problem. Suppose we are 
interested in calumation, which is said to be a property of program executions. I begin with 
the following informal “definition”. 


Let D bea program with the property that if its execution calumates, then its 
execution does not calumate; and if its execution does not calumate, then its 
execution calumates. 


The word “definition” was placed in quotation marks because someone might very 
reasonably object that I have not defined anything; there cannot be such a program. 
Someone else might say that it is a definition because it has the form of a definition, even 
though it is an inconsistent definition. It is not important to me to decide between these two 
viewpoints; what is important to me is that we all agree that there is an inconsistency. I will 
say it is a definition only because it makes my English sentences more direct. 

At this stage we have only an inconsistent definition, not yet any program purported to 
determine calumation. The inconsistency is far too obvious, so I now begin a process of 
obfuscation, being careful to maintain the inconsistency. 
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Let C be a program whose execution calumates. 

Let N be a program whose execution does not calumate. 
Now I can make the definition of D a little more formal. 

D = if execution of D calumates then N else C fi 
This is almost a normal recursive definition in any programming language. To finish making 
it so, 

Let H be a boolean expression (or boolean function of no arguments) that tells 

specifically whether execution of D calumates. 
Now we define 

D = if H then N else C fi 
I will obfuscate a little more in a moment, but I want to take stock now. The inconsistency 
is still there. It was not introduced by H . But now we can start to shift the blame from D 
to H. We can say that D would be a perfectly good definition in any programming 
language, except that there cannot be such an expression as H . That's because we ask H 
to perform an impossible task. If it says T then D=N and so it should say L; if it says L 
then D=C and so it should say T . The problem is not that H must compute a well- 
defined but incomputable function. The problem is that H has an inconsistent specification. 

My next obfuscation is to make H more general. I want to give it a program parameter 
so that I can show it working on some programs, like C and N ,and so make it seem more 
reasonable. But to do that, I first have to encode programs as data. So I redefine 

D = “if H (D) then N else C fi” 
Now D is completely innocent: it is just a text (character string). And I redefine H to 
have one text parameter p so that 

H(p)=T if text p represents a program whose execution calumates 

H (p)= L iftext p represents a program whose execution does not calumate 

H (p)= L (or T , it doesn't matter which) if text p does not represent a program 
Now I can proudly display H (“C”’)=T and H (“N”’) = L , showing how reasonable the 
specification of H is, before we consider the more difficult question of H (D) . 

I have been using a simplified kind of program that has no input, or dependence on 
initial state. I have argued that input, or initial state creation, can be moved to the initial part 
of the program without loss of generality. Now I unsimplify. I redefine H to have two 
parameters: 

H (p,q) is a boolean expression that tells whether execution of the program 

represented by text p calumates when provided with input text q 
I must also redefine D : 

D = “if H (D, D) then N else C fi” 
For each parameter p and q ,there is only one text D that we supply as argument, which 
is why parameterization was unnecessary. But the occurrence of “ H (D, D) ” motivates the 
name “diagonal”: the parameters (p, q) give us a two dimensional space, and point (D, D) 
is on its diagonal. 

Finally, we make the well-known argument: if H (D,D) is T then D represents a 
program equivalent to N and so H (D,D) shouldbe L; if H (D,D) is L then D 
represents a program equivalent to C and so H (D, D) shouldbe T . 

The definitions are now sufficiently complicated to hide the source of the inconsistency. 
So, before the argument showing inconsistency, we slip in an assumption, and after the 
argument showing inconsistency, we conclude its opposite. Any assumption will do; we 
could assume there is life on other planets, but that would be too obviously irrelevant. So 
we assume that calumation is computable because that looks like it might be relevant, and 
then conclude that calumation must be incomputable. 
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We have been talking about calumation, which has not been defined. To be more 
definite, I now replace calumation by halting, and conclude that halting is incomputable. 
This proof is the same as Turing's, except that we are using a modern programming language 
rather than Turing Machine operations, and we are using a text encoding rather than a 
numeric encoding. But this “proof” that halting is incomputable actually has nothing to do 
with halting, and nothing to do with computability. 


Self-Duplicating Program 


Diagonalization was Turing's method of creating a self-reference. We have been using the 
modern programming construct of recursive call for that purpose. Let me characterize a call 
by this little conversation: 

“What computation are you calling, please?” 

“The one named “P” .” 
If P is the name of the program (or specification) in which this answer occurs, then it is a 
recursive call. Another way to answer the question is directly. For example, 

“What computation are you calling, please?” 

“This one: “x:= 0” .” 
To make a direct answer recursive (self-referential) requires a trick usually attributed to 
W.V.O.Quine: we make the answer be a text expression that evaluates to a representation of 
the program (or specification) in which it occurs. 


Define 
oQ = (the opening quotation mark) 
cQ = (the closing quotation mark) 
T = (a program whose execution terminates) 
N = (a program whose execution does not terminate) 
H (p) = (a program that implements the halting function) 


D (x,y,z) = xoQxcOyoQycQyoQzcQz 


Every programming language provides some way of writing the text that consists of a 
quotation mark (perhaps by writing it twice, or perhaps by preceding it with a backslash), 
and I ask you to fill in the definitions of oQ and cQ with whatever your favorite 
programming language uses. For T choose any program whose execution terminates, and 
for N choose any program whose execution does not terminate. The ability to define H as 
a program is the computability assumption. And finally, program D has three text 
parameters and produces a text result by catenation. 
After these definitions, the program 


if H (D (“ if H (D (’’,“,”, “ )) then N else T fi ”)) then N else T fi 
is very interesting because H is being applied to an argument 
D (“if H (D (”,“;,“ )) then N else T fi ”) 


that evaluates to the text representing the program itself. (You should try evaluating this 
expression to see for yourself.) In other words, the program has the form 
if H ( 2 ) then N else T fi 

where 7 is a text expression whose value represents the program itself. So if execution of 
this program terminates, then it is equivalent to N whose execution does not terminate. 
And if execution of this program does not terminate, then it is equivalent to T whose 
execution does terminate. We have an inconsistency. The usual conclusion is that there 
cannot be any way to define program H , and the halting function is incomputable. 
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Suppose the language includes the halting oracle, and we use it as the definition of H . 
This time, we are not supposing that the halting function is computable. But we come to the 
exact same inconsistency. 

Alternatively, drop the definition of H , and notice that the specification 

if h (D (“if h (DC, “,”, “ )) then N else 7 fi ”)) then N else T fi 
has the form 

if h ( S ) then N else T fi 
where S is a text expression whose value represents the specification itself. Again, we are 
not supposing that the halting function is computable, but we come to the exact same 
inconsistency. Evidently, the inconsistency is not due to the computability assumption. 


Bounded Halting Problem 


One way to define the halting function h is by means of the bounded halting function b . 
Define b (p, n) to say whether execution of the program represented by text p terminates 
within n steps, or time units. Function b can be defined without inconsistency, and it is 
computable. Then we define h as 

hp) = da: N b@,n) 
where N is the natural numbers. 

Program text can refer to any program name that has been defined (and placed in the 

dictionary). If a program B has been written for function b , and we define D as 

D = “if B(D,n) then while T do x:= 1 od else x:= 0 fi” 
there is still no problem because calls add to the step count, and B terminates its execution 
when the step count exceeds n . The step count, or execution time, accounts for the 
computational nature of programs. But when h is defined as above, the existential 
quantification hides the step count, reporting only whether it is finite or infinite, and so we 
lose most of the computational nature of programs. Now, when d is defined as 

d = “if h(d) then while T do x:= 1 od else x:= 0 fi” 
we have the same inconsistency as before. The pair of definitions ( h , d ) is inconsistent. 


Automated Proof 


Robert Boyer and J Moore completely formalized and verified a proof of the incomputability 
of the halting function using an automated prover [0]. They use a constructive logic in 
which all recursions must be well-founded to ensure termination. They define the bounded 
halting program B (p,n) , but they cannot define the halting program 

H(p) = da: N B(p,n) 
because they lack quantification over an infinite domain. 

In place of Turing Machine operations, they use LISP programs, which are defined by 
writing a bounded EVAL function to interpret LISP. When execution of p runs past n 
steps, EVAL (p,n) returns the result BTM . So “execution of p fails to halt” becomes 

Vn: N: EVAL (p,n) = BTM 
which cannot be expressed due to the unbounded quantification, but which can be proven by 
induction for any choice of nonterminating p . 

In place of a numeric encoding of programs, they use a textual encoding, as does this 
paper. And they define a diagonal element CIRC using recursion, very similar to the 
diagonal definition of D in this paper. The theorem they prove, paraphrased roughly, says: 
If a program named HALTS behaves like the halting function (returning T for programs 
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that halt and F for those that don't), then HALTS applied to CIRC returns BTM . This is 
inconsistent, and they conclude that halting is incomputable. 

The Boyer-Moore proof can be challenged as follows. Let x and y be natural 
variables, and let x:= ? mean that x is nondeterministically assigned an arbitrary natural 
number. There are many ways to implement nondeterministic assignment, so we can 
include it in a programming language. We have to decide whether EVAL is an interpreter, 
making a particular choice of value, or a semantics, making all choices of value. Either way, 
the problem is that natural induction fails to classify executions of the program 

x:=?; while x>0 do x:= x-1; y:=?; while y>0 do y:= y-1 od od 
as always terminating. 

If we exclude nondeterminism from programs, then Boyer and Moore proved what 
Turing proved, and what is proved in the Halting Problem box at the beginning of this paper. 
Like Turing, they did not question whether the mathematical halting function is inconsistent 
if it is required to apply to a diagonally constructed element. They show that a program with 
exactly the same properties is inconsistent, and conclude incomputability of an unchallenged 
mathematical function. 


Incomputable by Reduction 


Many functions have been proven to be incomputable by reduction (directly or indirectly) to 
the halting function. The reduction argument goes like this: if f were computable, then 
here is how we could use f to compute the halting function; but since the halting function 
is incomputable, f must be too. I make no challenge to the validity of the reductions, but I 
suggest they may be proving something different. They may be proving: if the definition of 
f were consistent, then here is how we could use f to compute the halting function; but 
since the definition of the halting function is inconsistent, the definition of f must be too. 
The inconsistency of f may be due to the application of f to a diagonally defined element 
that was unintended in the informal definition of f , just like the halting function. 


Conclusion 


In Turing's day, mathematics was a well established subject, and computation was very new. 
So it seemed reasonable to ask: which mathematical functions are computable? A computer 
scientist might ask the reverse question: which programs can be characterized as total 
functions from initial state to final state? In other words, which programs have terminating 
computations? Without interaction, termination is essential, otherwise there is no output. 
Total functions from initial state to final state are a reasonable characterization of 
noninteracting terminating computation. But the question of termination has lost most of its 
interest in recent years. Today, computation is usually interactive and potentially infinite; 
termination occurs when you click on “quit”. Today, total functions from initial state to final 
state are inadequate to characterize computation. 

Even for noninteracting computation, it is worthless to specify or guarantee or require 
termination. If a specification says that the result is 2 and we observe that the result is 3 , 
we have grounds for complaint. If a specification says that execution is nonterminating and 
we observe termination, again we have grounds for complaint. If a specification says that 
execution is terminating (but gives no time bound), there is never a time when we can 
observe the computation violating the specification. Specifying or guaranteeing or requiring 
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termination (with no time bound) is worthless because it does not rule out any observation. 
It seems to me that theoretical computer science began badly by its emphasis on termination. 

Turing said “what I shall prove is quite different from the well-known results of Gödel” 
[6 p.259 referring to 2], but I think it is the same result. Gödel showed that if a logical 
formalism is expressive enough to encode its own interpreter, then the formalism is either 
inconsistent or incomplete. From a programmer's point of view, if we apply an interpreter to 
a program text that includes a call to that same interpreter with that same text as argument, 
then we have an infinite loop. A halting program has some of the same character as an 
interpreter: it applies to texts through abstract interpretation. Unsurprisingly, if we apply a 
halting program to a program text that includes a call to that same halting program with that 
same text as argument, then we have an infinite loop. A mathematical version of it cannot 
escape the corresponding problem: either we leave the definition of the halting function 
incomplete, not saying its result when applied to its own program, or we suffer 
inconsistency. If we choose incompleteness, we cannot require the program version to apply 
to texts that invoke the halting program, and we cannot conclude that it is incomputable. If 
we choose inconsistency, then it makes no sense to propose a program version. Either way, 
the incomputability argument is lost. 
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Appendix: Turing's Words 


Here is the key paragraph from Turing's paper. To help the modern reader, I have added the 
square bracketed words. 


Let us suppose that there is a such a process; that is to say, that we can invent a machine D 
which, when supplied with the S.D [standard description] of any computing machine M will 
test this S.D and if M is circular [prints a finite sequence] will mark the S.D with the symbol 
"u" [unsatisfactory] and if it is circle-free [prints an infinite sequence] will mark it with 
"s" [satisfactory]. By combining the machines D and U [universal machine, or interpreter] 
we could construct a machine H to compute the sequence beta' [a sequence that differs from 
the diagonal with U]. ... Now let K be the D.N [description number] of H. What does H do 
in the Kth section of its motion? [What happens when H works on the representation of H?] 
It must test whether K is satisfactory, giving a verdict "s" or "u". Since K is the D.N of H 
and since H is circle-free, the verdict cannot be "u". On the other hand, the verdict cannot 
be "s". For if it were, then in the Kth section of its motion H would be bound to compute the 
first R(K-1)+1 = RCK) figures [R(n) is the number of terminating programs among the first n 
programs] of the sequence computed by the machine with K as its D.N and to write down 
the R(K)th as a figure of the sequence computed by H. The computation of the first R(K)-1 
figures would be carried out all right, but the instructions for calculating the R(K)th would 
amount to "calculate the first R(K) figures computed by H and write down the R(K)th". This 
R(K)th figure would never be found. I.e., H is circular, contrary both to what we have found 
in the last paragraph and to the verdict "s". Thus both verdicts are impossible and we 
conclude that there can be no machine D. 


